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Abstract 



Polar codes are constructed for m-user multiple access channels (MAC) whose input alphabet 
size is a prime number. The block error probability under successive cancelation decoding is 
o{2~^^^^ where is the block length. Although the sum capacity is achieved by this coding 
scheme, some points in the symmetric capacity region may not be achieved. In the case where 
the channel is a combination of linear channels, we provide a necessary and sufficient condition 
characterizing the channels whose symmetric capacity region is preserved upon the polarization 
process. We also provide a sufficient condition for having a total loss in the dominant face. 

1 Introduction 

Polar coding, invented by Arikan [2] , is the first low complexity coding technique that achieves 
the symmetric capacity of binary-input memoryless channels. The idea is to convert a set of 
identical copies of a given single user binary-input channel, into a set of almost extremal chan- 
nels, i.e. either almost perfect channels, or almost useless channels. This phenomenon is called 
polarization. 

Arikan's technique was generalized in [6] for channels with arbitrary input alphabet size. 
The probability of error of successive cancelation decoding of polar codes was proved to be 
equal to o{2~^"^'') |3]. 

In the case of multiple access channels, we find two main results in the literature: (i) E. 
§a§oglu et al. constructed polar codes for the two-user MAC [7j, (ii) E. Abbe and E. Telatar 
constructed polar codes for the m-user MAC with binary input [1]. In this project, we combine 
the ideas of and p,J to construct polar codes for the m-user MAC whose input alphabet size 
is a prime number (or a power of a prime number). 

In our construction, as well as in both constructions in |7J and ]T|, the sum capacity is 
preserved upon the polarization process but some points in the symmetric capacity region are 
not always achieved by the polar coding scheme. A part of the symmetric capacity region is 
lost by polar coding. In this project, we study this loss in the special case where the channel is 
a combination of linear channels (this class of channels will be introduced in section 6). 

In section 2, we introduce the preliminaries for this project. We describe the polarization 
process in section 3. The rate of polarization is studied in section 4. Polar codes for the m-user 
MAC are constructed in section 5. The problem of loss in the capacity region is studied in 
section 6. Finally, we conclude this project in section 7. 

2 Preliminaries 

Definition 1. A discrete m-user multiple access channel (MAC) is an (m + 2)-tuple P = 
{Xi, X2, Xm, y, fp) where Xi, Xm are finite sets that are called the "input alphabets" of 
P, y is a finite set that is called the "output alphabet" of P, and fp : A'iX^2X---X'^mX3^ — ^ [0, 1] 
is a function satisfying V(xi, X2, Xm) € x <%'2 x ... x fp{xi,X2, ...,Xm,y) = 1- 

Notation 1. We write P : X1XX2 x...xXjn — y y to denote that P has m users, Xi, X2, Xjn 
as input alphabets, andy as output alphabet. We denote fp{xi,X2, ...,Xm,y) by P{y\xi, X2, ...,Xm) 
which is interpreted as the conditional probability of getting y at the output, given that we have 
{xi,X2, ...,Xm) at the input. 
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Definition 2. A code C of block length N and rate vector {Ri,R2, ...,Rm) is an {m + l)-tuple 
C = ifi,f2,---,fm,g), where fk ■ VVk = {1,2,..., \a^^>']} is the encoding function of 

the k^^ user and g : 3^" Wi x W2 x ... x Wm is the decoding function. We denote fk{w) = 
{fk{w)i, fk{w)N) , where fk{w)n is the n^^ component of fk{w). The average probability of 
error of the code C is given by: 



iWil X ... X \W„ 
{wi,...,wm)eWix...xyVm 



N 

Pe{wi,...,Wm) = Y -P(yn|/l(tfl)n, •••,/m(^«m)n) 

where a is a pre- determined real number according to which the rate of information is measured. 
If a = 2, the rates are expressed in bits, and if a = e, the rates are expressed in nats. 

Definition 3. A rate vector R = (Ri, Rm) is said to be achievable if there exists a sequence 
of codes Cn of rate vector R and of block length N such that Pe{CN) tends to zeros as N tends 
to infinity. The capacity region of the MAC P is the closure of the set of all achievable rate 
vectors. 

Theorem 1. (Theorem 15.3.6 f^) The capacity region of a MAC P is given by the closure of 
the convex hull of the rate vectors {Ri, Rm) satisfying 

R{S)<I[S]{P) for all SC{l,...,m} 

for some probability distribution P{y\xi, Xm)pi{xi)...pmixm) on X\ x ... x x 3^. where 
Is 

R{S) ■.= Y,Rk, X{S) := {X,„...,X,^^) forS = {si,...,si,} and I[S]{P) := I{X{Sy,YX{S^)). 

k=l 

All the mutual informations are calculated using logarithm base a. 

Definition 4. I{P) ■= /[{I, m}](P) is called the sum capacity of P, it's equal to the maxi- 
mum value of Ri + ... + Rm when (Ri, Rm) is achievable. The set of points of the capacity 
region satisfying Ri + ... + Rm = -^[{1, ■■■,'m}]{P) is called the dominant face. 

In this project, we are interested in MAC where Xk = Fg and we take a = q. We will focus 
our attention on the case where g is a prime number since we can easily generalize for the more 
general case where g is a power of a prime number. More particularly, we are interested in the 
symmetric capacity region which is defined by: 

J{P) = Rm): 0< R{S) < I{X{Sy, YX{S'^)) yS C {l, m}} 

where the mutual informations are calculated for independently, uniformly distributed Xi,...,Xm- 

3 Polarization process 

Definition 5. Let P : F™ y be a discrete m-user MAC. We define the two channels 
P- : F™ ^ 3^2 P+ : ^ X as: 

P'{yi,y2\u\,...,ul^) = Y ■^PiyiWl + ul,...,ul^ + ul^)P{y2\ul,...,ul^) 
P^{yi,y2,u\, ...,ul^\ul, ...,ul^) = -^Piyilul + ul, ...,ul^ + ul^)P{y2\ul, ...,ul^) 
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and can be constructed from two independent copies of P as follows: The /c*^ user 
chooses independently and uniformly two symbols [/^ and [/| in F^, he calculates X^. = Uj.+U'^ 
and X| = J7|, and he finally sends X^. through the first copy of P and X| through the second 
copy of P. P~ and P+ are the channels Ul...U^ — )■ Y1Y2 and U'^...U^ — > YiY2Ul...U}j^ respec- 
tively, where Yi and Y2 are the respective outputs of the first and second copy of P. 

Note that the transformation (Ul, U^, f/^, U^) {Xl, X^,Xf, ...,X^) is bijective 
and therefore it induces uniform and independent distributions for Xj^,Xf, X^ which 

are the inputs of the P channels. 

Definition 6. Let {Bn}n>i be i.i.d. uniform random variables on {—,+}• We define the 
MAC-valued process {Pn}n>o by: 

Po ■.= P 

Pn := Pn-1 Vn > 1 

Proposition 1. The process {I[S]{Pn)}n>o is a bounded super-martingale for all S C {!,..., m}. 
Moreover, it's a bounded martingale if S = {l,...,m}. 

Proof. 

2I[S]iP) = I[S]iP) + I[S]{P) = IiX\S)-YiX\S'')) + IiX\sy,Y2X\S'')) 

= I{X\S)X^{Sy,YiY2X\S^)X\S^)) = I{U\S)U\S);YiY2U\S^)U\S^)) 
= I{U\sy,YiY2U^{S'')U^{S'')) + I{U^{S)-YiY2U^{S^)U^{S^)U^S)) 
> I{U\sy,YiY2U\S'')) + IiU\sy,YiY2Ul..U^U\S^)) 

= i[s]ip-) + i[s]{p+y 

Thus, E{l[S]{Pn+i)\Pn) = lnS]{Pn) + ll[S]iP^) < nS]{Pn), and I[S]iPn) < \S\ for all S C 
{l,...,m}, which proves that {-f [-S'](P„)}„>o is a bounded super-martingale. If S = {!,..., m}, 
the inequality becomes equality, and {I[S]{Pn)}n>o is a bounded martingale. □ 

Since l{I[S]iP-) + I[S](P+)) < /[5](P) V5 C {!,..., m}, then lj{P-) + lj{P+) c J{P), 
but this subset relation can be strict if one of the inequalities is strict for a certain S C {1, m}. 
Nevertheless, for S = {l,...,m}, we have ^(/(P^) + /(P+)) = /(P), so at least one point of 
the dominant face of i7(P) is present in |j7(P— ) + ^J'(P+) since the capacity region is a 
polymatroid. Therefore, the sum capacity is preserved. 

Prom the bounded super-martingale convergence theorem, we deduce that the sequences 
{I[S]{Pn)}n>o converge almost surely for all S C {1, ...,m}. 

The main result of this section is that, almost surely, P„ becomes an almost deterministic 
linear channel: 

Theorem 2. Let P be an m-user MAC. Then for every e > 0, we have: 

{s G {-,+}' : 3^, G F^><'-%rank(^) = r„ 
\L{A^Us; Ys) - 7(P^)| < e, |r, - /(P^)| < e} = 1 



lim -7 



jpmxr^ denotes the set of mx rg matrices with coefficients in ¥q, Ug (uniform random vector in 
¥'^) is the input to the channel P^ , and Yg is the output of it. 
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To prove this theorem, we need several lemmas and definitions: 

Lemma 1. (15]) For any e > 0, there exists 5 > such that for any single user channel P we 
have: 

I{P+)-I{P)<5^I{P) i (e,l-e) 

Definition 7. LetP he an m-user MAC, let A G F^^"i, B G F^^"^ tyj^ matrices 

such that [A B] G f™^''"^^"^-' has rank ni + n2 (If n2 = 0, we put B = We define the 
ni-user channel P[A\B] : F^^ y x F^^ as follows: 



P[Amy,v\u) = -^ E ^(^1^^) 



A'^ x=u,B'^ x=v 

If iT'2 = 0, there is no additional v at the output, and the B^x = v constraint under the sum is 
removed. 

Remark 1. The channel P[A\B] can be constructed from P as follows: Let x G be the input 
to the channel P, u = A^x is determined by the ni users of P[A\B]. Since ni may be less than 
m, then x cannot always be determined from u. u only determines a part of x, the other part 
is determined by the channel P[A\B] uniformly and independently from u: 

Let A be an mx (m — ni) matrix such that [A A] is invertible, i.e. the columns of A together 
with those of A form a basis for F^. Since [A A] is invertible, we could determine x if we knew 
[A A]'^x. But [A A]'^x = [{A^x)^ {A'^x)'^]'^ = [ii^ u^]'^ . We only have u and we need u to 
fully determine x. The channel P[A\B] generates u uniformly and independently from u. The 
output of the channel P[A\B] is y (the output of P) together with B^x. In other words, in 
P[A\B], we try to determine A^x, when y (the output of P) and B^x are given. 

In summary, P[A\B] corresponds to the channel U = A^X — )• {Y,B'^X), where U is a 
uniform random vector in F^^, X = {[A A]'^)~^[U'^ U'^]'^ and Y is the output of the channel 
P when X is the input. A is any m x {m — ni) matrix such that [A A\ is invertible, and U is a 

i'JTt 
1 

IiA'^X;Y,B^X). 



uniform random vector in¥^ "i, independent from U. We have I{P[A\B]) = I{U;Y, B"^ X) 



Let A = [A' A"] and B be two matrices having m rows such that [A B] is full rank. If X 
and Y are as above, then we have: 

I{P[A\B]) = I{A^X; Y, B^X) = I{A'^X, A"^X; Y, B^X) 
= I{A'^X; Y, B'^X) + I{A"^X; Y, B'^X, A''^ X) 
= I{A'^X; Y, B^X) + I{A"^X; Y, [B A'fX) 
= IiP[A'\B]) + l{P[A"\[B A']]) 



Lemma 2. Let A G F™^"i, B G F^^"^ A' G Fg'''"'\ B' G Fg'"""^ j^^^. matrices such 

that [A B] G F^^^"^^"^^ has rank ui + n2, ana 
P[A\B][A'\B'] is equivalent to P[AA'\[B AB']] 



that [A B] G F^''^"^+"'^ has rank m + n2, and [A' B'] G Fy^''^"'i+"2) j^^g ^^^^ ^/^ _^ ^_ jf^^^ 
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Proof. P[A\B][A'\B']{y,b,b'\a') 

' E p(y\^) = -J^ E ^(2/1-) 



P[AA'\[B AB']]{y,[F 6'T|«') 



□ 



Lemma 3. P[yl.|B]+ is degraded with respect to P'^[A\B]. P [A\B] is degraded with respect to 
P[^|B]~, and if B = <^, they are equivalent. 

Proof. 

P+[A|S](yi,y2,ui,6|a) = E P^{yi.y2,ux\u2) 

^ U2=S,,B^ U2=b 

E P{yi\ui + u2)P{y2\u2) 



^2m— m 



A^U2=S,B^U2=6 



P[^|5]+(yi,6i,y2,^2,ai|a2) = -\-P[A\B]{yiM\ai + a2)P[A\B]{y2Ma2) 



72m— ni 



E E Piyi\^i)Piy2\u2) 



ttleF™ M2eF^" 

j4-^«i=ai+a2 A'^U2=a'2 
B'^ui=bi B'^U2=b2 



Ji^rr E E PiyiWi + ^2)Piy2\u2) 



„2m— ni 

^ «ieF^ tt2eF^ 

A^ui=ai A^U2=32 

B^«l=fel-fe2 BTu2=b2 



^ P+[^|P](2/l,y2,«l,fe2|52) 
iTiGF" 
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A-''«i=ai 
{(1=61—62 



Which proves that P[y4|S]+ is degraded with respect to P+[^|P]. 
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P[A\B]-{yi,bi,y2MSi) = -^ ^ P[A\B]{yi,h\ai + a2)P[A\B]{y2,b2\a2) 



qni 

^ E E E Piy^\u.)P{y2\u2) 

aaSFji «iGF^ ^aSF- 
A'^ui=ai+a2 A'^U2=a2 

E E E ^(yil«l+^2)P(y2|i?2) 

aaeF^i "leF™ «2eF- 
A'^ui=d\ '0,2=3,2 
B^ui=bi—b2 B^U2=b2 

-2^^^ E E Piyi\^i+^2)Piy2\u2) 

^ uleF,^" 'U2eF^ 

A'^ui=ai B'^U2=b2 
B^ui =61-62 



P [A\B]{yi,y2,b\a) = ^ iyi,y2\ui) 

ui=a,B^ ui=b 

= ^2^ E E Piyi\^l+^2)P{y2\u2) 

uieF^ U2GF^ 
A''" ui=3,B'^ ui=b 

= J2 P[A\B]-{yi,b + b2,y2,b2\S) 

Which proves that P~[^|B] is degraded with respect to P[^|B]~. Note that if 5 = 

all the B'^u constraints are eliminated, and there is no more b vectors. In this case we have 
P[^|$]"(yi,y2|ai) = P~[A\^{yi,y2\di). Therefore, P[^|$]- and p-[^|$] are equivalent. □ 

Proposition 2. The process /(P^[v4|P])^> coTiverges (iliTi,ost surely for any two full Tank uia- 
trices A and B (B can be ^) whose columns are linearly independent. Moreover, the limit takes 
its value in the set of integers. 

Proof. Let's start with the case P = <I>, we have: P^[74|$] is equivalent to Pn[A|<I>]^ so 
/(P-[A|$]) = /(Pn[A|$]-), andP„[A|$]+ is degraded with respect to P+[^ I so/(P+[A|$]) > 

Let U[, U2 be two independent uniform random vectors in F^^ which will be the input to 
P„[y4|<l>]" and P„[A|<I>]+ respectively. Ui = U{ + U2 and U2 = U2 wih be the inputs to two 
independent copies of P„ respectively. Let Ui and U2 be two independent uniform random 
vectors in W^""^ which are independent from Ui and U2, let Xi = {[A A]'^)~'^[Ul tjj]^ for 
i G {1,2} (see remark 1), and let Yi (resp. Y2) be the output of the channel P„ when Xi 
(resp. X2) is the input. A is any m x (m — ui) matrix such that [A A] is invertible. We have: 
I{Pn[A\^]-) = I{Ui;YiY2), /(P„[^|<i>]+) = I{U^;YiY2Ui), Uk = A^X^ and /(^^Xfc;^) = 
I{Pn[A\^]) for k G {1,2} (see remark 1). Thus: 
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I{p-[Am+IiP+[Am > I{Pn[A\<^>]-) + I{Pr,[Am+) = I{U[;Y^Y2) + I{U!,;Y,Y2U[) 

= I{U[U^;YiY2) = I{UiU2;YiY2) = I{A^ Xi, A^ X2;YiY2) 
= I{A^Xi-Yi)+I{A^X2;Y2) = 2/(P„[^|^.]) 

Therefore = \l{P~[A\^) + \l{P+[A\^) > I{Pn[A\^. Moreover 

we have /(P„[A|<I>]) < ni, so the process /(P„[A|<I>])„>o is a bounded sub-martingale. By 
the bounded sub-martingale convergence theorem we deduce the almost sure convergence of 
/(i-'„[yl|<^])„>0 for any full rank matrix A. 

From remark 1 we have I{Pn\A\B]) = I{Pn[[A B]\<^]) - /(P„[S|^>]), so /(P„[^|-B])„>o also 
converges almost surely for any two matrices A and B such that [A B] is full rank. 

Now suppose that j4 = a is a column vector (i.e. ni = 1), the almost sure convergence 
of I{Pn[a\B])n>Q implies the almost sure convergence of \l {Pn+i[a\B]) — I{Pn[a\B])\ to zero, 

and so £'^|/(Pn+i[a|-B]) — /(Pn[a|-B])| converges almost surely to zero, since I{Pn[oi\B]) 

is bounded. We have also: 

E(\^I{Pn+i[a\B])-I{Pn[d\B])\ |P„) > l(/(P+[a|P])-/(P„[a|S])) 

>\{l{Pn[a\B]+)-I{Pn[a\B])) 

The first inequality comes from the expression of the expectation and the second one comes 
from the fact that /(P„[a|P]^) is degraded with respect to /(P^[a|P]) {lemma 3), we conclude 
that I{Pn[a\B]'^) — /(P„[a|P]) converges almost surely to zero. 

Let {Pn}n>o be a realization in which I{Pn[d\B])n>o converges to a certain value I € [0, 1] 
(we have < I(P„[a|P]) < 1). Due to the convergence of I{P^[a\B]) — I{Pn[d\B]) to zero 
and of /(P„[a|P]) to I, for every e > 0, there exists reo > such that for any n > no we have 
|I(P+[a|P]) — I(P„[a|P])| < 6 where 6 is as in lemma 1, and |/(P„[a|P]) — /| < e. We conclude 
from lemma 1 that I(P„[a|P]) ^ (e, 1 - e) and since |/(P„[a|P]) - /| < e then / ^ (2e, 1 - 2e), 
and this is true for any e > 0. We conclude that I € {0, 1}. 

Now let A = [ai ... 0^1] be any full rank matrix, then by remark 1 we have: 

"1 

/(P„[^|P]) = Y,l{Pn[ak\[B ai ... afc_i]]) 

k=l 

Since each of I{Pn[ak\[B ai ... ak~i]]) converges almost surely to a value in {0,1}, then 
/(P„[^|P]) converges almost surely to an integer. □ 

Corollary 1. The limit of the process I[S]{Pn) is almost surely an integer for all S C {1, m}. 

Proof. If we take As = [cfc, k £ S] and P5 = [e^, k £ S'^], where {e^} is the canonical basis 
of F^, then I[S]{Pn) = I{Pn[As\Bs]). The assertion about the limit comes from the previous 
proposition. □ 

Lemma 4. (lemma 33^) Let X, W be two independent and uniformly distributed random 
variables in ¥q, and let Y be an arbitrary random variable. For every e' > 0, there exists 5 > 
such that: 
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• IiX,Y) < 5, I{W-Y) < 6, H{X\YW) < S, H{W\YX) < 5 and 

• H{pX + -fW\Y) ^{5,1- 5) for all /3, 7 G ¥q, 

implies 

I{P'X + y y) > 1 - e' 

for some G 

Lemma 5. For every m > 0, there exists > such that for any e < e^, if P is an m-user 
MAC satisfying d{I {P[A\B]) ,1.) < e for all matrices A and B (B can he then I{P) > 1 - e 
implies the existence of a non-zero vector a. G satisfying /(P[a|$]) > 1 — e. 

Proof. Choose 5 as in the above lemma for e' = | , and then choose = min{(5, ^ , } (note 
that our choice of is non-increasing with m). Let {e^, I < k < m} be the canonical basis of 
F™, let < e < e„i and let P be an m-user MAC satisfying d{I{P[A\B]),Z) < e for all matrices 
A and B. 

We will prove the lemma by induction on m. If m = 1, we set a = [1], so /(P[a|$]) = 
I{P) > 1 — e. For m > 1, from remark 1 we have: 

m 

1 - e < I{P) = /(P[[ei ... em]\^]) = ^ l(P[4| [el+i - e^]]) 

fc=i 

If /(P[efc|[efc+i ... e^,]]) < e for all k, we get 1 — e < me e > which is a contra- 
diction, so there exists at least one k satisfying / (P [e^ | [cfc+i ... 6^]]) > 1 — e. If A; > 1, then 
P[[efc ... er„]|$] has m — k + 1 < m users, and /(P[[efc ... e„J|$ 
1 — e. By induction we get a vector a' G ^m-k+i g^^j^ ^j^g^^ Up 



> /(P [4 1 [4+1 - e^]]) > 
[Ck ... ern]\<^][d'\^]) > 1 - e. 
Let a = [ik ... 4i]a', then by lemma 2 we have /(P[a|$]) = /(P[[efc ••• 4i]|^] [a'l^]) > 1 - e 
and we are done. 

If A; = 1 then /(P[el|[e2 ... Cm]]) > 1 - e, so we have (see remark 1 ) : 

7(P[[ei ... em-i]\Sm])=l{P[[S2 ... e^-i]|em]) +/(P[el|[e2 ... e^]]) 

>/(P[el|[e2 ...Cm]]) > 1-e 

P[[ei ... 4n-i]|em] has m — 1 users. Therefore, by induction we can get a vector a' G F^"-*^ 
such that J(P[[ei ... e^-i] |em] [«'|^]) > 1 — e. Let a" = [ei ... e^-ija', then we have 
I{P[d"\em]) = ^(P[[ei ... em.-i]|em] [a'l^]) > 1 - e (see lemma 2). If U and Y are the in- 
put and output to the channel P respectively then I{X\YW) = I{P[d"\em]) > 1 — e, where 
X = a"'^U and W = e^U (see remark 1). li I{X] F) > 1 - e or I{W] > 1 - e, we set a = a" 
OY d = Cm respectively and we are done. 

If/(X;y) < eand/(VF;y) < e, we have /(X; YVF) > 1-e, soH{X\YW) < e. I{XW;Y) = 
I{X- Y)+I{W- YX) = I{W; X)+I{X; YW) which implies I{W; YX) > I{X; YW)-I{X; Y) > 
1 - 2e > I > e, thus I(W; YX) > 1-e and H{W\YX) < e. Moreover, from the hypothesis we 
have I{PX + jW; Y) = I{P[pd" + Te^^l^]) ^ (e, 1 - e) so H{PX + jW\Y) ^ (e, 1 - e) for all 

Notice that e < < (5. Therefore, by the above lemma there exist P', 7' G ¥q such that 
I{P'X + yW;Y) > 1 - e' = § > e which implies I{P[d\^]) = I{P'X + ^'W;Y) > 1 - e for 
d = P'd" + j'em. 

□ 
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Proposition 3. For any e < em, if P '■ Ui...Um Y is an m-user MAC that satisfies 

d{I{P[A\B]),7j) < e for all matrices A and B (B can he then there exists a matrix Ap 
of rank r such that \I{Ap'^U;Y) - I{P)\ < 2e, \I{Ap'^U;Y) - r\ < e and \I{P) - r\ < e. 

Proof LetV = {a eP^ -.a = 6 or I{a^U; Y) > 1 - e}, then V is & subspace of F^: 

Let a G F and /3 G Fq, if a = or ^ = then pd = eV. If a 7^ and 7^ 0, then 
I{Pa^U; Y) = I{a^U- y) > 1 - e and /3a G V. 

Suppose m > 1, let ai,a2 G V and (3i,(32 G ¥q. We can suppose that ai,a2 7^ and 
Pi,P2 7^ because otherwise we would be in the previous case. Let U' = aJU, U" = a^U, 
X' = PiU' + P2U" and X" = PiU' + (^2 + l)U" then the transformation {W, U") {X',X") 
is invertible. Thus: 

I{X'- Y) + I{X"; YX') = I{X'X"; Y) = I{U'U"; Y) = I{U'; Y) + I{U"; YU') 

> I{U'- Y) + I{U"- y) > 2 - 2e 

If y) < e then I{X"; YX') >2-3e>2-| = l which is a contradiction (remember 
that e < em < \). So + P2d2fU\Y) = I{X';Y) > 1 - e, and Pidi + p2d2 G V. 

Therefore, F is a subspace of F^. 

Let di, ...,dr be a basis of V, and let dr+i, dm be m — r vectors extending {di, a^} 
to a basis of F^. Define the two matrices A := [di ... dr] and B := [or+i ••• dm]- 

If I{P[B\^]) = I{B^U;Y) > 1 - e, then by considering the channel P[i?|$] wc get 
by lemma 5 a vector /3 G F™"*" such that /(P[-B|<I>] [/3|<I>]) > 1 — e, but this means that 
Ii{BP)'^U;Y) = /(P[S/3|$]) = I{P[B\^][0\^]) > 1 - e (see lemma 2) and so B^ G V, and 
therefore BP can be written as a linear combination of the vectors of A which form a basis for 
V . But this is a contradiction since the vectors of A and B are linearly independent. Therefore, 
we must have < I{P[B\<t\) < e. 

On the other hand, we have: 

r r 

I{A^U- Y) = 7([ai ... drftJ- Y) = Y, H^kU; Y, [ai ... dk-ifU) > Ii<^kU; Y) > r - re 

k=l k=l 

So r - re < I{A'^U;Y) < r, and {liA^UiY) -r\<re<^ = l- ^<l-e 
(remember that e < < but I{A^U ; Y) = /(P[A|$]), and from the hypothesis we have 

d{I{P[A\^]),Z) < e, so r is the closest integer to I{P[A\^]), thus r - e < I{A'^U;Y) < r and 
{liA^U; Y)-r\< e. Moreover I{A^U] Y, B^U) < r so we get: 

r-e< I{A^U; Y) < I{A^U; Y, B^U) < I{A^U; Y, B^U) + I{B^U; Y) < r + e 

and since the matrix [^4 B]'^ is invertible, then we have: 

I{P) = 10; Y) = I{[A BfU; Y) = I{B^U; Y) + I{A'^U; Y, B^U) 

Therefore, r-e < I{P) < r + e since r-e < I{A'^U;Y,B'^U) + I{B'^U;Y) < r + e. We conclude 
that \I{P) -r\ < e, and \I{A^U;Y) - I{P)\ < 2e because we already have \I{A'^U;Y) - r\ < 
e. □ 

Now we are ready to prove theorem 2: 
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Proof, (of theorem 2) 



The processes I{Pn[A\B]) converge almost surely to integers, and therefore the maximal 
distance between and the set of integers converges almost surely to zero. For large 

enough n, this maximal distance becomes less than and by the previous proposition we 
conclude that the channels P„ almost surely become almost deterministic linear channels. 

For the sake of accuracy, we provide the following rigorous proof. Let P be an m-user MAC, 
and e > 0. Let S = ^ min{e, e^}. For n, I G N*, define the event Tn,k as: 

Tn,k = {Vni G [1, m] n N, Vns G [0, m - m] n N, V[A B] G 

rank([A B]) = m + ^ d{I{Pn[A\B]),Z) < ^ } 

Since the processes /(Pri[74|5]) converge almost surely to integer values for all matrices A and 
B such that [A B] is full rank, then the event = P) U P) Tn,k has probability 1. Let 

k>ll>ln>l 

k > ^, Pi(^ IJ Pi Tn,k^ - ^'^( n U Pi ^>'^) ^ events P Tn,k are increasing with I, 

l>ln>l k>ll>ln>l n>l 



l>ln>l n>l 



SO Pr(^ (J P Tn,k) = lim Pr(^ P Tn,k) = 1. We conclude that: 

lim Pr(7I,fe) > lim Pr( P Tn,k) = 



Z— ^-oo , . - ^ 

n>l 



The event Ti^k implies d{I{Pi[A\B]),Z) < ^ < 6 < Cm ior all matrices A and B satisfying 
[A B] is full rank. Then by proposition 3, there exists a matrix Ap^ of rank r such that 
\I{Ap^'^Up,;Yp^) - I{Pl)\ < 2(5 < e, \I{Ap^'^Up^■,Yp^) - r\ < S < e and \I{Pi)-r\ < S < e, where 
Upi and are the input and output of Pi respectively. We conclude that the event Ti^k implies 
the event Ci defined by: 

Ci = {3Ap, G rank(^pj = rp,,\I{Aj.^Up,;Yp,) - I{Pi)\ < e, \rp, - I{Pi)\ < e} 

So lim Pr(C;) = 1. By examining the explicit expression of Pr(C;) we get: 

Z— >oo 



lim —7 



{s G {-,+}' : 3^, G F^x'-%rank(^) = r„ 
\I{AjU,; Ys) - I{Pn\ < e, |r, - I{P')\ < e} 



= 1 



□ 



4 Rate of polarization 

Now we arc interested in the rate of polarization of P„ into deterministic linear channels. 

Definition 8. The Battacharyya parameter of a single user channel Q with input alphabet X 
and output alphabet y is defined as: 

^^^^ = \X\(\X\-l^ ^ ^ ^Q{y\x)Q{y\x') 
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It's known that Pe{Q) < qZ{Q) (see [6]), where Pe{Q) is the probabihty of error of the 
maximum hkehhood decoder of Q. 

Lemma 6. Let P be an m-user MAC. For any a G FJ" we have: 



Um —r 

Z— >oo 2 



{s G {-, +y : I{P'[m) > 1 - Z{P^[am > 2-2 } 
for alio < e < 1, < 13 < ^. 



Proof. I{Pn[a\^]) converges almost surely to or 1, and this means that Z(i-*„[a|<I>]) converges 
also almost surely to or 1 due to the relations between the quantities I{Q) and Z{Q) (see 
proposition 3 of [^). 

Z(P+[a|$]) < Z(P[a|$]+) since P[a|$]+ is degraded with respect to P+ [a |^>], and Z(p- [a|^>]) 
^(^[51$]-) since P[a|^>]- and p-[a|$] are equivalent. From [6j we have: 

Z{P[a\^-) < qZ{P[a\<^]) and Z(P[a|$] + ) = Z{P[a\<^]f 

Now we can apply theorem i of [3] to get 



lim Pr(7(P„[a|$]) > 1 - e, Z(P„[a|$]) > 2 







by examining the explicit expression of the last probability we get the result. □ 
Theorem 3. The convergence of Pn into deterministic linear channels is almost surely fast: 



lim —7 

l^oo 2^ 



{s G {-,+}' : 3As = [ai ... G F^x'-% rank(^) = r, : 
\I{A^Us;Ys)-I{Pn\<e,\rs-I{Pn\<e,f2ziPnakm)<2-^''} 



k=l 



for all < e < 1, < P < ^. Us and Yg are the input and output of P^ respectively. 



Proof Let /? < /3' < i define: 



Es = {se{-,+Y : I{P'[a\^])>l-2e,Z{P'[d\<i>])>2'^^''} for a G F; 



Ei = {se {-, +y : 3As G F-x'^% rank(A,) = r„ |/(^f C/.; Ys) - HP')] < " ^(^')l < 4 

E2 = {se {-,+}' : 3^ = [ai ... G F™x'-% rank(^) = r„ 

|/(Af?7,;y,)-/(P^)| <e,|r,-/(P^)I <e,f^Z(P^[afcI<I>]) <r,2-2'''} 

A;=l 



If s G £^i/(UaGF™ -^a) t^cn 3As = [oi ... 5^,] G F'g"'<''= such that rank(As) = r^, jr^ - 
/(P*)| < e and [/(^f K,) - /(P"")! < e (so |/(^f y,) - r,| < 2e). For 1 < A: < m we have: 
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I{AlUs;Y,) = I{P'[Asm) = /(P^[afc|$]) + ... Sk-idk+i ... a^]|cSfc]) 

>r-2eVA;€ {l,...,rs} 

and since /(P* ... dk-idk+i ■■■ am]|<3fc]) < r— 1, then I(P* [5^1$]) > l—2e which implies 

Z(P^[afc|$]) < 2-2''' since s ^ for all k G {l,...,rj. So ^ Z(P^[afe|$]) < rs2-2" \ and 

fc=i 

therefore s £ E2. Thus £'i/( |J E'ef) C £^2 and |£;2| > - ^ \Es\- By theorem 2 and 
lemma 6 we have: 

1 > lim i|E2| > hm - ^ =1-0 = 1 

By noticing that rs2~'^^ ' < m2~'^^ ' < 2~'^'^' for / large enough, we conclude the limit in the 
theorem. □ 

5 Polar codes construction 

Choose < e < 1 and < ^5 < ^' < 5, let / be an integer such that q2^2~'^^ ' < 2-2"' and 

Ei = {s(^ {-,+}' : 3r„3^ = [ai ... G F™x-%rank(^) = r, : 

\I{A^,Us;Ys)-I{n\ < |,|r, -/(P^)| < |, < 2-2"''} 

A;=l 

Such an integer exists due to theorem 3. 

For each s G {— , +}', ii s ^ Ei set P(s, /c) = 1 VA; G {1, /c}, and if s G -E; choose a matrix 
As of rank rg which satisfies the conditions in Ei, then choose a set of indices Sg = {h, ■■■irs} 
such that the corresponding rows of Ag arc linearly independent then set F{s, k) = 1 ii k ^ Sg, 
and F{s, fc) = if /c G Sg. F{s, k) = 1 indicates that the user k is frozen in the channel P*, i.e. 
no useful information is being sent. 

A polar code is constructed as follows: The user k sends a symbol Ug^k through a channel 
equivalent to P^. If F{,s,k) = 0, Ug^k is an information symbol, and if F{s,k) = 1, Ug^k is a 
certain frozen symbol. Since we are free to choose any value for the frozen symbols, we will 
analyze the performance of the polar code averaged on all the possible choices of the frozen 
symbols, so we will consider that Ug^k are independent random variables, uniformly distributed 
in ¥q Vs G { — , +}', Vfc G {1, ...,m}. However, the value of Ug^k will be revealed to the receiver 
if P(s, k) = 1, and if P(s, k) = the receiver has to estimate Ug^k from the output of the channel. 

We associate the set {— , +}' with the strict total order < defined as si...si < s'^...sj if and 
only if Si = —,s'j^ = + for some i G {1, /} and Sh = s'f^\/h> i. 
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5.1 Encoding 

Let {Ps}se{-,+y be a set of 2' independent copies of the channel P. Do not confuse Pg with 
P*, Pg is a copy of the channel P and P^ is the polarized channel obtained from P as before. 

Define Us^,s2,k for si e { — , +}'', S2 G { — , +}'~^', < < I inductively as: 

• U<s,,s,k = Us,k ifl' = 0,s£ {-, +y. 

• f^(si;-),s2,fe = t^si,(s2;+),fc + f^si,(s2;-),fe if ^' > 0, si G {-, +}''-\ S2 G {-, +}'"''• 

• C/(.i;+),.2,fe = t^.i,(s2;+),ifc if I' > 0, si G S2 G {-,+}'-''. 

The user A; sends Us,q>^k through the channel Pg for all s G {—,+}'• Let Yg be the output 
of the channel Pg, and let Y = {Yg}g^^_ ^y. We can prove by induction on I' that the channel 

s has si as prefix i {Us'}s'<s2) is equivalent to P*^ particular, the channel 
Ug — >■ (Y, {Ugi}g'^g) is equivalent to the channel P*. 

5.2 Decoding 

11 s ^ El then P(s, fe) = 1 for all /c, and the receiver knows all Ug^k, there is nothing to decode. 
Suppose that s G Ei, if we know {Us'}s'<s then we can estimate Ug as follows: 

• If P(s, fc) = 1 then we know Ug^k- 

• We have P(s, k) = Q for values of k corresponding to linearly independent rows of 
Ag. So if we know ^Ug, we can recover Ug^k for F{s,k) = 0. 

• If As = [ai ... cJr^], then we can estimate AjUg by estimating a^C/s for /i G {1, r^}. 

• Since aJ^^Ug (Y, {C/s',ifc}s'<s) is equivalent to P''[ah|^], we can estimate aJI^Ug using the 
maximum likelihood decoder of P^'fa/jl^]. 

• Let T>g{Y,{Ugi}gi^g) be the estimate of Ug obtained from {Y,{Ugi}gi^g) by the above 
procedure. 

This motivates the following successive cancelation decoder: 
. ilg = Ugiis^ El. 

• dg = Vg{Y,0g,}g,^g) ifseEi. 

5.3 Performance of polar codes 

If s G El, the probability of error in estimating oiJ^Us using the maximum likelihood decoder 
is upper bounded by qZ{P^[ah\(^]). So the probability of error in estimating AjUg is upper 

bounded by ^ (/Z(P*[afc|$]) < q2~^'^ Therefore, the probability of error in estimating Ug 

k=i 

-* d'l 
from (Y, {Ugi}gi^g) is upper bounded by g2~^ when s e Ei 

Note that Vs{Y,{Us,}g,<s) = Us, (Vs G Ei) ^ VgiY, 0g>}g>^g) = Ug (Vs G ^i), so the 
probability of error of the above successive cancelation decoder is upper bounded by 

^ PT{Vg{Y,{Ug,}g,^g) + Ug) < \Ei\q2-'''' < ql^-^-''' < 2-''' 
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The above upper bound was calculated on average over a random choice of the frozen symbols. 
Therefore, there is at least one choice of the frozen symbols for which the upper bound of the 
probability of error still holds. 



The last thing to discuss is the rate vector of polar codes. The rate at which the user k is 

communicating is Rk = ^ (l — F{s, k)), the sum rate is: 

seEi 

l<fe<m l<k<mseEi seEi 

We have |/(-P^) —rs\ < § and I{P^) < rs + | for all s € Ei. And since we have ^(^^) = 

s€{-,+y 

2'/(P) we conclude: 

^(^) = ^ E = ^ E '(n + ^ E '(n < ^ E (-^ + 1) + ^i^fi^ 

se{-,+}' seEi s€Ef sGEi 

To this end we have proved the following theorem which is the main result in this report: 

Theorem 4. For every < e < 1 and for every < /? < |, there exists a polar code of length 
N having a sum rate R > I{P) — e and a probability of error < 2~^'^ . 

A final note to report is that by changing our choice of the indices in Ss, the rate vector of 
the polar code moves at a distance of at most e along the dominant face of the capacity region 
achievable by polar codes. However, the dominant face of the initial capacity region can be 
strictly bigger than the dominant face achievable by polar codes. 

6 Case study 

In this section, we are interested in studying the problem of loss in the capacity region by po- 
larization in the special case of channels which are combination of deterministic linear channels. 

Definition 9. An m-user MAC P is said to be a combination of n linear channels, if there are 
n matrices Ai, An, {A^ G F^^"**^) such that P is equivalent to the channel P; : x ... x F^ — )■ 

n 

[j {k} X F^'' defined by: 
k=l 

Pi{k,y\x) = h '[f^^^ VA;G{l,...,n},VyGF-\VxGF- 
10 otherwise 

n n 

where "^^Pk = 1 and pk 7^ \/k. The channel Pi is denoted by Pi = PkCA,, ■ 

k=l k=l 

The channel P; can be seen as a box where we have a collection of matrices. At each channel 
use, a matrix A^ from the box is chosen randomly according to the probabilities pk- The output 
of the channel is A'^x, together with the index k (so the receiver knows which matrix has been 
used). 
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6.1 Chciracterizing non-losing channels 

In the case of channels that are combination of Hnear channels, we are interested in finding in 
the channels whose capacity region is preserved upon the polarization process. 

Proposition 4. If{Ak,A'i^ : 1 < k < n} is a set of matrices such that span(^fe) = span(A'^) VA; G 

n n 

{1, ...m}, then the two channels P = ''^^pkCA^. and P' = Pk^A'^, o^t^- equivalent. 

k=l k=l 

Proof. If span(Afc) = span(74'^), we can determine A^x from A'j^'^x and vice versa. Therefore, 
from the output of P, we can deterministically obtain the output of P' and vice versa. In this 
sense, P and P' are equivalent, and have the same capacity region. □ 

n 

Notation 2. Motivated by the above proposition, we will write P = ^^PfeCy^ where {V^ : 1 < 

fc=i 

n 

k < n} is a set of n subspaces o/F™, whenever P is equivalent to Pk^Af, and span(yl/j) = V^. 

fe=i 

n II 

Proposition 5. If P = ^pfcCv^., then I[S]{P) = pfc dim (pro j g ( Vfc ) ) for all S C {1, ...,m}. 

fc=i k=l 
Where projg denotes the canonical projection on defined by proj5(x) = proj5(a;i, ...,Xm) = 
{xn,:.,Xi^g^) forx = {xi,...,Xm) e and S = {ii, ...,i\s\}- 

Proof. Let Xi,...,Xjn be the input to the channel Ylk=iPkCAk (where Ak spans V^), and let 
K, Y be the output of it. We have: 

H{X{S)\K,Y,X{S^)) 

= J2PT{k,y)H{X{S)\k,y,X{S')) = ^ ^ Pr(A;, y|x)Pr(x)iJ(X(5)|A^je, X(S^)) 

k,y k,y x 

= E E PkP<m{HS)\k,y,X{S^))=Y,PkH{X{S)\AlX,X{S^)) 



k,y X 



Alx=y 



= J2PkH{X{S)\Ak{SfXiS),X{S')) = J2PkH{X{S)\Af,{SfX{S)) 

k k 

Where ^4^(5") is obtained from A^ by taking the rows corresponding to S. For a given 
value of Ak{S)'^X{S), we have q'^'' possible values of X{S) with equal probabilities, where 
dk is the dimension of the null space of the mapping X{S) — >■ Ak{S)^X{S), so we have 
H{X{S)\A,{SfX{S))=dk. 

On the other hand, \S\ — H (^X {S)\Ak{S)'^ X (S)) = \S\ — is the dimension of the range 
space of the the mapping X{S) — ^ A}~{S)^ X{S), which is also equal to the rank of Ak{SY' . 
Therefore, we have: 

\S\ - H{X{S)\Ak{SfX{S)) = Y&nk{Ak{Sf) = rank(Ajk(S)) = dim(|span(y4fe(5))) 

= dim(^span(proj5(Afc))) = dim(^proj5(span(^fe))) 

= dim(projs(Vfc)) 
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We conclude: 

I{X{Sy,K,Y,X{S^)) = H{X{S)) - H{X{S)\K,Y,X{S^)) 
= \S\ - Y,PkHiXiS)\Ak{SfX{S)) 

k 

= Y,Pk{\S\-HiX{S)\AkiSfX{S))) 

k 

= ^Pk{\S\ - dk) = ^pjkdim(proj5(Vfc)) 
k k 

□ 

n 

Proposition 6. If P = "^^pkCy^. then: 

k=l 

n n 
ki = lk2 = l 

n n 

• P^=J2Y1 PkiPk^Cv.^+V,^ 

fcl=lfc2 = l 

n 

Proof. Suppose without lost of generality that P = ^^p^C^^ where spans 14. Let Ui be 

fc=i 

an arbitrarily distributed random vector in F™ (not necessarily uniform), let U2 be a uniformly 
distributed random vector in F™ independent from Ui. Let Xi = Ui + U2 and X2 = U2- Let 
[Ki, A'j^^Xi) and {K2, Aj^^^X2) be the output of P when the input is Xi and X2 respectively. 
Then the channel Ui — > (i^i, Aj^^Xi, K2, AJ^^^X2) is equivalent to P~ with Ui as input. Wc did 
not put any constraint on the distribution of Ui (such as saying that Ui is uniform) because in 
general, the model of a channel is characterized by its conditional probabilities and no assump- 
tion is made on the input probabilities. 

Fix Ki = ki and K2 = k2, let ^fciAfe2; -^fei -B^j be three matrices chosen such that ^fciAfe2 
spans Vifci n Vk^, Aj,^ = [Ak^^k^ ^fej spans Vjki, Aj,^ = [^feiAfca -^fc?] spans V^j^and the columns 
of [^feiAjk2 ^ki Bk2] are linearly independent. Then knowing A'^^Xi and Al^^X2 is equivalent to 
knowing A]^^^^^{Ui + U2), Bj^^iJJi + ^72), ^^^^^2^2 and BIJJ2, which is equivalent to knowing 

= ^Ak^Uu T2 ,,2 = BliU, + U2) and = [A^.^k, Bk,]^U2. We conclude that P" 

is equivalent to the channel: 

C/i -)■ (Ki , if2 , ,i^2 ' ^ii ,/<r2 ' ^ii ,K2 ) 

Conditioned on {Ki,K2,Tl^^ j^^) we have [Bki Aki/\K2 BK2YU2 is uniform (since the matrix 
[Bk-^ AkiaK2 BK2] is full rank) and independent from Ui, so [Ak'^aK2 BK2YU2 is independent 
from (S^^[/2, t^i), which implies that [AkiaK2 -B;f2]'^^2 is independent from (^B'^_^{Ui+U2),Ui) ■ 
Also conditioned on {Ki, K2,T^^ ^^), B'^JJ2 is uniform and independent from Ui, which im- 
plies that Ui is independent from Bj^^{Ui + U2), and this is because the columns of Bkj and 
AK1AK2 arc linearly independent. We conclude that conditioned on {Ki, K2,Tj^^ j^^), Ui is 
independent from (Tii,x2' ^ii,X2) • Therefore, {Ki, K2,fj^^ j^^) = {Ki, K2, Al^^^^Ui) form 
sufBcient statistics. We conclude that P~ is equivalent to the channel: 
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And since Pr{Ki = k\,K2 = fe) = PkiPk^^ ^feiAfe2 spans Vk^ Ci Vk^ we conclude that 

n n 
ki = l fe2 = l 

Now let U2 be arbitrarily distributed in F™ (not necessarily uniform) and C/i be a uniformly 
distributed random vector in F™ independent from 1/2- Let Xi = Ui + U2 and X2 = U2- Let 
[Ki, Aj^^Xi) and (i^2,^'/^2 2) be the output of P when the input is Xi and X2 respectively. 
Then the channel U2 — )■ {Ki, Aj^^Xi, K2, A^^X2,Ui) is equivalent to with U2 as input. 
Note that the uniform distribution constraint is now on C7i and no constraint is put on the 
distribution of U2, since now U2 is the input to the channel P+. 

Knowing A^^Xi, A^_^X2 and Ui is equivalent to knowing A^^{Ui + U2), A^Jj2 and Ui, 
which is equivalent to knowing A^_^U2, A^^U2 and Ui. So is equivalent to the channel: 

U2^{Ki,K2,[Ak, Ak,fU2,Ui) 

And since C/i is independent from U2, the above channel (and hence P+) is equivalent to 
the channel: 

U2 ^ {K,,K2,[Ak, Ak,fU2) 
We also have Pr(Ki = ki,K2 = k2) = PkiPk2J [Aki A^^] spans Vk^ + V^j- We conclude 

n n 

that =^Y1 PkiPk2Cvk^+Vk^- n 

ki=lk2=l 

n 

Lemma 7. Let P = '^^Pk^Vk "-'"'d S C {1, ...,m}, then 
k=l 



i(l[5](P~) + /[5](P+))=/[5](P)^ 



(v(A;i, A;2); projsiVk, n F^J = pioisiVk,) n proisiVk^)) 



Proof. We know that if V and V are two subspaces of F^, then proj5(y n V) C proj_5(y) fl 
proj5(F') and proj5(F + V') = projg(F) + proj5(F'), which implies that: 

dim(proj5(y n V')) < dim(proj5(y) n proj5(y')) 
dim(projs(y + V')) = dim(proj5(y) + piois{V')) 

We conclude: 

dim(proj5(F n V')) + dim(proj5(y + V')) 

< dim(proj5(y) n projs(y')) + dim(projc;(y) + prois{V')) 
= dim(proj5(y)) + dim(proj5(F')) 

Therefore: 
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i(7[5](P-)+/[S](P+)) 

^ n n n n 

= 2( X] X] ^'fcif'fc2dim(proj5(V'fci nVfcj)) + X] XI PfciPfe2dim(proj5(Viti +^fe2))) 

fci=l A;2=l fei=lA;2=l 

= 2(5^^ PfeiPfe2(dim(proj5(Vifci fl Vk^)) + dmi{pvo]s{Vk^ + ^fe2)))) 

fci=lfe2 = l 

- 2( X] PfciPfe2(dim(proj5(T4j) + dim(proj5(Ffc2)))) 

fcl = l ^2 = 1 

^ n n 



2 

fcl=l fc2=l 

So if we have proj5(Vfe^ fl V^j) C proj5(VfcJ n proj5(Vfc2) for some ki,k2, then we have 
dim(proj5(Vjfcj n V^j)) < dim(proj5(VfcJ fl pioiglVk^)), and the above inequahty of mutual 
information will be strict. We conclude that: 

l{l[S]iP-) + I[S]iP+))=I[S]{P)^ 

(y{ki,k2); proj5(Ffci n FfcJ = proj5(VfeJ n proj^CVfej) 

□ 

Definition 10. Let V be a set of sub spaces ofW^, we define the closure ofV, cl(V), as being 
the minimal set of subspaces of¥^ closed under the two operations fl and +, and including V. 
We say that the set V is consistent with respect to S C {1, m\ if and only if it satisfies the 
following property: 

, V2) G cZ(V); projs{Vk-^ n Vjtj) = projs{Vk-t) fl projs{Vk2 

Corollary 2. If V = {V^ : 1 <k <n}. I[S]{P) is preserved upon the polarization process if 
and only ifV is consistent with respect to S. 

Proof. Upon the polarization process, we are performing successively the n and + operators, 
which means that we'll reach the closure of V after a finite number of steps. So /[>S'](P) is 
preserved if and only if the above lemma applies to cZ(V). □ 

The above corollary gives a characterization for a combination of linear channels to preserve 
I[S]{P). However, this characterization involves using the closure operator. The next propo- 
sition gives a sufficient condition that uses only the initial configuration of subspaces V. This 
proposition gives some "geometric" view of what the subspaces should look like if we don't 
want to lose. 



Proposition 7. // there exists a subspace Vs of dimension \S\ whose projection on S is (i.e. 
proj5(V5) = Fg ), such that for every V eV we have proj^CV^ (IV) = proj^Cl^), then I[S]{P) 
is preserved upon the polarization process. In other words, if every subspace in V passes through 
Vs "orthogonally" to S, then I[S]{P) is preserved upon the polarization process. 
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Proof. Let Vs be a subspace satisfying the hypothesis, then it satisfies also the hypothesis if we 
replace V by it's closure: If Vi and V2 are two arbitrary subspaces satisfying 

P^oisiVs n Vi) = proj5(yi) and pvojsiVs n V2) = proj5(F2) 

then proj9(Vi) C pioigiVs H (Vi + V2)) and proj5(V2) C ]prois{Vs n {Vi + V2)), which implies 
proj5(Vi + V2) = projs(Vi) + proj5(y2) C prois{Vs n (Fi + V2)). Therefore, prois{Vs n {Vi + 
^2)) = proj5(Vi + V2) since the inverse inclusion is trivial. 

Now let X £ proj_5(Fi) fl projg(V2), then x G proj5(Fi) = projij(Vi n Vs) and similarly 
X € proj5.(V2 n Vs) which implies that there are two vectors xi £ ViCiVs and € V2 H such 
that x = proj_5(xi) = piois{x2)- And since proj5.(Vs') = and dim(Vs) = then the map- 
ping proj^ : — > Fq is invertible and so xi = X2 which implies that x G proj5(Vi n V2 n Vs). 
Thus proj5(yi) n proj5(y2) C proj5(Fi n V2) C proj5(yi n ^2 n Vs). We conclude that 
proj5(Vi) n proj5(V2) = proj5(Fi fl V2) = proj5(yi fl V2 fl Vs) since the inverse inclusions 
are trivial. 

We conclude that the set of subspaces V satisfying proj5(y Pi Vs) = proj5(y) is closed 

under the two operators H and +. And since V is a subset of this set, cl(y) is a subset as well. 
Now let 1^1,^2 G cl(y), then projgiVs Ci Vi) = proj5(Vi) and proj5(F5 fl V2) = proj5(V2)- Then 
proj5(Vi) nprojg(V2) = proj5(Vi n V2) as we have seen in the previous paragraph. We conclude 
that V is consistent with respect to S and so I[S]{P) is preserved. □ 

Conjecture 1. The condition in proposition 7 is necessary. 
6.2 Total loss in the dominant face 

After characterizing the non-losing channels, we are now interested in studying the amount of 

loss in the capacity region. In order to simplify the problem, we only study it in the case of 
binary input 2-user MAC since we can easily generalize for the general case. 

4 

Since we only have 5 subspaces of F2, we write P = ^^p^Cy^, where Vq, V4 are the 5 

fe=o 

possible subspaces of F|: 

^o = {(0,0)} 

^1 = {(0,0), (1,0)} 
^2 = {(0,0), (0,1)} 

y3 = {(o,o),(i,i)} 

^4 = {(0,0), (1,0), (0,1), (1,1)} 

Wehave/[1](P) = pi+ps+p^, I[2]{P) = P2+P3+P4 and I{P) = I[1,2]{P) = pi+p2+P3 + 2p4. 

4 

Definition 11. Let P = "^^pkCy^ and s G {—,+}', we write p| to denote the component of 

k=0 

4 

Vk in P*, i.e. we have P^ = pj^Vk ■ 

k=0 

We denote the average of Pf, on all possible s G {—,+}' by p^^\ i.e. p^^ = p|. 

se{-,+}' 

is the limit of p^^ as I tends to infinity. 
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We denote the average o//[l](P*) (resp. I[2]{P^) and I{P^)) on all possible s € {—,+}' 
by if^ (resp. 12'^ and I^''''). We hxive if* = pf'' + p^^ + , = P2 + + P^^ CLnd 
/(O = p^^ + P2^ + Ps^ + If I tends to infinity we get l[°°'^ = p^^'' + P^"^ + P4^\ 1^°°^ = 

p(^) + + and /(~) = p'f^^ + pt^ + pt^ + 2^°°) . 

Definition 12. We say that we have total loss in the dominant face in the polarization process, 
if the dominant face of the capacity region converges to a single point. 

Remark 2. The symmetric capacity region after I polarization steps is the average of the sym- 
metric capacity regions of all the channels obtained after I polarization steps (s € {—j+Y)- 
Therefore, this capacity region is given by: 

J{P^^^) = {{Ri,R2) : 0<Ri< iP, 0<R2< 4\ 0<Ri + R2< 
The above capacity region converges to the "final capacity region": 

= {(i?i,i?2) : < i?i < l[°°\ < i?2 < lt\ 0<Ri+R2< /^°°)} 

The dominant face converges to a single point if and only if l''°°^ = /-[°°'' + I^\ which is 
equivalent to p^^^ + ^2°°^ + P^^^ + 2^4°°^ = + ^2°°'' + 2^3°°^ + 2^4°°^ . We conclude that we 
have total loss in the dominant face if and only if p^^^ = 0. 

Lemma 8. The order ofpi,p2 and p^ remains the same upon the polarization process, e.g. if 
Pi < P3 < P2 then pI < pI < p\, and if P2 = Pz <P\ then p'2 = P3 < Pi for all s G {— , +}'. 

44 44 
Proof We have P~ = PkPk'CvkVWy and = PkPk'Cvk+Vy Therefore, we have: 

fe=0 fe'=0 fe=0 fe'=0 

Pq =pI + 2po(pi +P2+PZ+ Pa) + '^{piPi + + P1P3) 
pT =pI + 2pip4 
P2 =pI + '^P2P4 
P3 =P3 + "^P^Pi 
Pa =pI 

+ 2 
Po =Po 

Pi =Pi + 2piPo 

P2 =P2 + 2P2P0 
Pt =P3 + 2P3P0 

Pt =pI + 2p4(Pl +P2+P3+ Pi) + 2(piP2 + P2P3 + P1P3) 

We can easily sec that the order of p^,P2 and is the same as that of pi,P2 and p^. This 
is also true for pf ,P2 and p^. By using a simple induction on wc conclude that the order of 
pI,P2 and p| is the same as that of pi,P2 and p^ for all s G {— , +}'. □ 

Lemma 9. For k G {1, 2, 3}, if 3k' G {1, 2, 3} \ {k} such that pk < Pk' then 

se{-,+y 

In other words, the component of Vk is killed by that of Vk' . 
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Proof. Wc know from theorem 2 that the channel converges almost surely to a deterministic 
linear channel as I tends to infinity (we treat s as being a uniform random variable in { — , +}'). 
Therefore, the vector (Po'PfjPl'^'i'^'l) converges almost surely to one of the following vectors: 
(1,0,0,0,0), (0,1,0,0,0), (0,0,1,0,0), (0,0,0,1,0) or (0,0,0,0,1). In particular, p^, converges 
almost surely to or 1. 

Since < pj^/ then p|. < pf,, for any s, and so p|. cannot converge to 1 because otherwise the 
limit of p'l, would also be equal to 1, which is not possible since none of the 5 possible vectors 
contain two ones. We conclude that p^. converges almost surely to 0, which means that p^^^ (the 
average of p| on all possible s G {— , +}') converges to 0. Therefore, =0. □ 

Proposition 8. If Ps < niax{pi,p2}; then we have total loss in the dominant face. 

Proof. If p3 < max{pi,p2}7 then by the previous lemma we have p^^ = 0. Therefore, we have 
total loss in the dominant face (see remark 2). □ 

Corollary 3. If we do not have total loss in the dominant face then the final capacity region 
(to which the capacity region is converging) must be symmetric. 

Proof. From the above proposition we conclude that p3 > max{pi,p2} and from lemma 9 we 
conclude that = p^^ = 0. Thus, l[°°^ = l'^^ = ^3°°'* + ^4°°^ and the final capacity region 
is symmetric. In particular, it contains the "equal-rates" rate vector. □ 

Conjecture 2. The condition in proposition 9 is necessary for having total loss in the dominant 
face. i.e. if pz > m.ax{pi,p2}, then we do not have total loss in the dominant face. 



7 Conclusion 

We have seen in this report how we can construct reliable polar codes for any m-user MAC 
with inputs in ¥q. We have seen that for < e and /3 < ^, a polar code of length N can be 
constructed such that its sum rate is within e from the sum capacity of the channel, and the 
probability of error is less than 2~^^ . 

We have seen also that although the sum capacity is achievable with polar codes, we may 
lose some rate vectors from the capacity region upon polarization. We have studied this loss in 
the case where the channel is a combination of linear channels, and we derived a characteriza- 
tion of non-losing channels in this special case. We have also derived a sufficient condition for 
having total loss in the dominant face in the capacity region (i.e. the dominant face converges 
to a single point) in the case of binary input 2-user MAC. 

Several questions are still open, the most important one is whether we can find a coding 
scheme, based on polar codes, in which all the symmetric capacity region is achievable. 
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